Pattern filtering and classification for market basket analysis with profit-based measures

نویسندگان

  • Mu-Chen Chen
  • Chuang-Min Chao
  • Kuan-Ting Wu
چکیده

Market basket analysis is one of the typical applications in mining association rules. The valuable information discovered from data mining can be used to support decision making. Generally, support and confidence (objective) measures are used to evaluate the interestingness of association rules. However, in some cases, by using these two measures, the discovered rules may be not profitable and not actionable (not interesting) to enterprises. Therefore, how to discover the patterns by considering both objective measures (e.g. probability) and subjective measures (e.g. profit) is a challenge in data mining, particularly in marketing applications. This paper focuses on pattern evaluation in the process of knowledge discovery by using the concept of profit mining. Data Envelopment Analysis is utilized to calculate the efficiency of discovered association rules with multiple objective and subjective measures. After evaluating the efficiency of association rules, they are categorized into two classes, relatively efficient (interesting) and relatively inefficient (uninteresting). To classify these two classes, Decision Tree (DT)-based classifier is built by using the attributes of association rules. The DT classifier can be used to find out the characteristics of interesting association rules, and to classify the unknown (new) association rules.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Dynamic Analysis of Market Efficiency on Benchmark Crude oil markets: Based on the Adaptive Market Hypothesis

This paper examines the applicability of the adaptive market hypothesis (AMH) as an evolutionary alternative to the efficient market hypothesis (EMH) by studying daily returns on the three benchmark crude oils. The data coverage of daily returns is from January 2th 2003 to March 5th 2018. In this paper, two different tests in the form of two distinguished classes (linear and nonlinear) have bee...

متن کامل

An Analysis of ‘Triangle Ordering’ in Foreign Exchange Market (Forex): Simultaneous Ordering of Three Major Currency Pairs

With considering a ‘triangle of three major currency pairs’, there is a tiny difference between multiplication of exchange rate for the first two currency pairs and the third. To discover whether this little difference can lead to a neutral arbitrage or not, I took portfolios of 35 baskets of three major currency pairs(combinations of all 7 major currencies). There are eight approaches (differe...

متن کامل

Supervised Learning-Based Collaborative Filtering Using Market Basket Data for the Cold- Start Problem

The market basket data in the form of a binary user-item matrix or a binary item-user matrix can be modelled as a binary classification problem. The binary logistic regression approach tackles the binary classification problem, where principal components are predictor variables. If users or items are sparse in the training data, the binary classification problem can be considered as a cold-star...

متن کامل

Ranking discovered rules from data mining with multiple criteria by data envelopment analysis

In data mining applications, it is important to develop evaluation methods for selecting quality and profitable rules. This paper utilizes a non-parametric approach, Data Envelopment Analysis (DEA), to estimate and rank the efficiency of association rules with multiple criteria. The interestingness of association rules is conventionally measured based on support and confidence. For specific app...

متن کامل

Classification-based collaborative filtering using market basket data

Collaborative filtering based on voting scores has been known to be the most successful recommendation technique and has been used in a number of different applications. However, since voting scores are not easily available, similar techniques should be needed for the market basket data in the form of binary user-item matrix. We viewed this problem as a two-class classification problem and prop...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Expert Systems

دوره 29  شماره 

صفحات  -

تاریخ انتشار 2012